Picture for Xue Yang

Xue Yang

Moment-Video: Diagnosing Temporal Fidelity of Video MLLMs on Momentary Visual Events

Add code
Jun 01, 2026
Viaarxiv icon

OmniInteract: Benchmarking Real-World Streaming Interaction for Real-Time Omnimodal Assistants

Add code
May 26, 2026
Viaarxiv icon

SkillOpt: Executive Strategy for Self-Evolving Agent Skills

Add code
May 25, 2026
Viaarxiv icon

From Raw Experience to Skill Consumption: A Systematic Study of Model-Generated Agent Skills

Add code
May 22, 2026
Viaarxiv icon

PhotoFlow: Agentic 3D Virtual Photography Missions

Add code
May 22, 2026
Viaarxiv icon

SpaceDG: Benchmarking Spatial Intelligence under Visual Degradation

Add code
May 21, 2026
Viaarxiv icon

SafeSteer: A Decoding-level Defense Mechanism for Multimodal Large Language Models

Add code
May 12, 2026
Viaarxiv icon

ComplexMCP: Evaluation of LLM Agents in Dynamic, Interdependent, and Large-Scale Tool Sandbox

Add code
May 11, 2026
Viaarxiv icon

SWIFT: Prompt-Adaptive Memory for Efficient Interactive Long Video Generation

Add code
May 10, 2026
Viaarxiv icon

Quantum Kernel Advantage over Classical Collapse in Medical Foundation Model Embeddings

Add code
Apr 27, 2026
Viaarxiv icon